Rank in Wordlist | Frequency | Word |
---|---|---|
3756 | 124 | 1,5 |
4295 | 108 | 2,5 |
8203 | 55 | 3,5 |
11402 | 38 | 0,5 |
12555 | 34 | 1,3 |
12565 | 34 | 4,5 |
13564 | 31 | 1,2 |
13571 | 31 | 5,5 |
16087 | 25 | 1,7 |
16633 | 24 | 1,4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
153105 | 1 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
11937 | 36 | 50% |
12557 | 34 | 10% |
13893 | 30 | 20% |
14691 | 28 | 90% |
16634 | 24 | 100% |
18603 | 21 | 5% |
20201 | 19 | 80%-i |
21054 | 18 | 15% |
21075 | 18 | 30% |
21084 | 18 | 60%-i |
Rank in Wordlist | Frequency | Word |
---|---|---|
40046 | 8 | R&B |
117448 | 2 | R&D |
118180 | 2 | S&P500 |
165280 | 1 | A&E |
176211 | 1 | Bar&Ristorante” |
182659 | 1 | C&C |
182660 | 1 | C&L |
184529 | 1 | Cocktail&Lounge” |
185783 | 1 | D&B |
190700 | 1 | E&C |
Rank in Wordlist | Frequency | Word |
---|---|---|
159181 | 1 | 200-$300 |
162818 | 1 | 50000$-a |
165279 | 1 | A$AP |
Rank in Wordlist | Frequency | Word |
---|---|---|
500 | 780 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
19351 | 20 | .' |
44537 | 7 | Vega'nın |
55816 | 5 | I'm |
57527 | 5 | Woo-jin'in |
65107 | 4 | Can't |
67054 | 4 | O'Konnell |
67882 | 4 | Scholar's |
67884 | 4 | Scott'un |
74675 | 4 | single'ı |
78972 | 3 | Assassin's |
Rank in Wordlist | Frequency | Word |
---|---|---|
39250 | 8 | A2c+3c |
101069 | 2 | 10+10 |
114560 | 2 | Na+/K |
153606 | 1 | 1+2+1 |
153994 | 1 | 10+1 |
155756 | 1 | 150+180 |
157597 | 1 | 1915+105 |
158872 | 1 | 2+3+3 |
159640 | 1 | 21+26 |
160777 | 1 | 3+2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
8527 | 53 | https://www |
11045 | 40 | km/saat |
12881 | 33 | 1/4 |
14475 | 29 | km/s |
19354 | 20 | 1/3 |
19813 | 20 | m/s |
19833 | 20 | m³/s |
20191 | 19 | 2/3 |
23758 | 16 | m/san |
25584 | 14 | 1/8 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots